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I Introduction 


Following  previous  studies  (Samejima,  1977b,  1977d,  1978a, 

1978b,  1978c,  1978d) , the  present  paper  proposes  another  method  for 
estimating  the  operating  characteristics  of  the  graded  items 
(Samejima,  1969,  1972),  in  which: 

(1)  no  mathematical  forms  are  assumed  for  the  operating  charac- 
teristics ; 

(2)  a relatively  small  number  of  examinees  are  used  in  the 
whole  process  of  estimation; 

(3)  the  maximum  likelihood  estimates  of  the  examinees  are  available 
because  of  a previous  testing;  and 

(A)  the  test  which  was  used  for  (3)  has  a constant  test  inform- 
ation function  for  the  range  of  ability  of  our  interest,  and 
the  amount  of  information  is  substantially  large. 

The  process  of  the  maximum  likelihood  estimation  using  the  response 
pattern  of  a specified  examinee  has  been  described  (Samejima,  1969). 

The  asymptotic  property  of  the  maximum  likelihood  estimate  (Samejima, 
1975),  i.e.,  the  normality  of  its  conditional  distribution,  given 
ability,  with  the  given  ability  itself  and  the  inverse  of  the  test 
information  function  as  its  two  parameters,  is  fully  used.  Since 
the  constancy  of  the  test  information  has  many  theoretical  advantages 
(Samejima,  1977a),  the  estimation  of  the  conditional  moments  of  ability, 
given  its  maximum  likelihood  estimate,  is  relatively  easy*,  provided 
that  the  density  function  of  the  maximum  likelihood  estimate  is  given. 

By  virtue  of  the  concept  of  weakly  parallel  tests  (Samejima,  1977c), 
the  results  are  directly  comparable  with  those  of  similar  studies  using 


* Refer  to  Appendix  II. 


I 

I 

1 

i 

I 

f 

different  sets  of  test  items  but  following  the  same  principles , Cl) 
through  (4). 

i 

The  actual  data  used  in  the  present  study  are  simulated  data 
of  five  hundred  hypothetical  examinees,  and  our  target  is  to  estimate 
the  item  characteristic  function,  i.e.,  the  operating  characteristic 
of  the  item  score  1 when  the  item  is  binary,  of  each  of  the  ten 
binary  items.  A brief  description  of  these  data  is  given  as  Appendix 
I.  The  ten  binary  items  follow  the  normal  ogive  model,  and  the  two 
parameters,  discrimination  and  difficulty  indices,  are  given  in  Tables 
6-3,  6-4,  etc.,  for  each  item. 

Throughout  the  study,  the  method  of  moments  (Elderton  and  Johnson, 

1969,  Johnson  and  Kotz,  1970)  is  frequently  used.  Among  others,  the 
density  function  of  the  maximum  likelihood  estimate  is  approximated 
by  a polynomial  of  degree  3 or  4, applying  the  method  of  moments  for 
the  set  of  500  maximum  likelihood  estimates. 
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II  Conditional  P .D.F.  Approach 

Conditional  P.D.F.  has  been  presented  and  developed  (Samejima, 
1978a,  1978b,  1978d) . Let  f(8)  be  the  density  function  of  ability, 
or  latent  trait,  9 , 0 | 9)  be  the  conditional  density  of  § , 

given  9 , which  is  approximated  by  n(0,o2)  where  cr2  is  the 

inverse  of  the  test  information  function,  1(9)  = 21.63  , and  g(§) 

* 

is  the  density  function  of  the  maximum  likelihood  estimate,  9 . 

We  can  write  for  the  conditional  density  function  of  § , given  9 , 

4><e|§)  = 1 0)  f (0)  [ I iK§|0) 

/ -oo 

= iK§ie)  f (9)  [g(§> r1  . 


(2.1) 


f (0)  d0] 


-1 


Let  P (6)  be  the  operating  characteristic  of  the  graded  item  score 

Xg 

category  x . The  estimate  of  the  operating  characteristic,  P (0)  , 
8 8 
in  the  Conditional  P.D.F.  Approach  can  be  written  in  the  general  form 

M 

,-l 


(2.2) 


P (9) 
x 

8 


i w(9  ) <)>(0|e  ) I i w(e  ) «i>( 0 1 0 ) ] 


seG 


s=l 


8 


where  G indicates  the  group  of  examinees  having  the  item  score  x , 

*8  „ 8 
N (-500)  is  the  number  of  examinees,  and  w(0  ) is  the  weight 

assigned  to  the  maximum  likelihood  estimate  of  the  examinee  s , When 

this  weight  is  constant  for  all  the  examinees,  (2.2)  is  called  Simple 

Sum  Procedure,  to  distinguish  itself  from  the  Weighted  Sum  Procedure 

(Samejima,  1978d) . 

To  use  (2.2),  it  is  necessary  to  obtain  the  estimated  conditional 
density  of  0 , given  0 , and  so  far  three  methods,  l.e.,  Pearson 
System  Method,  Normal  Approach  Method  and  Two-Parameter  Beta  Method, 
have  been  used.  In  the  first  method,  4>(9|^)  is  approximated  by 


» 

) 

i 

i 

I 

t ' 

one  of  the  Pearson  System  density  functions,  depending  upon  the  value 
of  Pearson's  criterion  < , which  is  obtained  from  the  first  four 

f 

conditional  moments  of  0 , given  8 . In  the  second  method,  <|>(0 1 8 ) 
is  approximated  by  a normal  density  function,  using  the  first  two 
conditional  moments  of  0 , given  9 , as  the  parameters.  In  the 

third  method,  we  use  a Beta  density  function  for  $( 0 | 0 ),  with  a 1 

s 

priori  set  two  parameters,  which  are  the  two  endpoints  of  the  interval 
for  which  the  Beta  density  function  assumes  positive  values,  and  the 
other  two  parameters  estimated  from  the  first  two  conditional  moments 
of  0 , given  0 . (For  further  detail,  cf.  Samejima,  1977d,  1978a, 

J 

1978b,  1978d) . > 

In  the  present  study,  the  concept  of  Conditional  P.D.F.  Method  ; 

is  expanded  and  generalized  to  mean  that  the  estimated  conditional 
density  $( 0 | 0)  is  fully  utilized,  without  using  any  estimated  joint 
density  of  0 and  9 . In  the  following  section,  therefore,  a new 

procedure  is  proposed  which  is  not  based  on  (2.2).  . 

r 

* 


i 
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III  Proportioned  Sum  Procedure 


Let  p(seG  ) be  the  probability  with  which  the  examinee  s 

Xg 

belongs  to  the  group  whose  item  score  for  item  g is  x , and 

E 


p(seG  ) be  its  estimate.  In  the  present  method,  the  estimated 

Xg 

operating  characteristic,  P (0)  , is  defined  by 

Xg 

N N 

P„  (6)  = £ p(seG  ) $(0 | 8 ) [ E $(0|8  )]~1  , 

X - X o , b 

g S=1  g s=l 


(3.1) 


where 


(3.2) 


E 8 p(seG  ) * E 6 p(seG  ) = 1 

v°  Xs  v°  X8 


m 

E 8 


The  main  difference  between  (3.1)  and  (2.2)  is  that  in  the  numerator 
of  (3.1)  the  estimated  conditional  densities  for  all  the  8 's 


participate,  while  that  of  (2.2)  includes 


) for  the  examinees 


who  belong  to  the  group  G only.  The  procedure  represented 

Xg 

by  (3.1)  will  be  called  Proportioned  Sum  Procedure  of  the  Conditional 


P.D.F.  Approach. 

Since  our  data  used  in  the  present  study  are  for  estimating 
the  operating  characteristics  of  binary  items,  it  is  sufficient  to 
estimate  the  item  characteristic  function,  i.e.,  the  operating 


characteristic  of  x ■ 1 , of  each  item.  Let  P (0)  be  the 

g g 


item  characteristic  function  of  item  g 
such  that 


We  can  rewrite  (3.1) 


N 


N 


(3.3) 


P (6)  - E p(seG)  $(0 1 0 ) [ E 4>(0 1 6 ) 1 
8 s-1  8 s-1  8 


-1 


where  G is  the  group  of  examinees  whose  item  score  for  item  g 
is  1 . 


I 

\ 

I 

t 

\ 

r 

The  estimated  probability,  p(sEG)  , or  the  proportioned 
weight,  in  the  present  study  is  given  by  the  proportion  of  the 

i 

examinees  who  belong  to  the  group  G within  a specified  interval 

whose  midpoint  is  0g  . This  interval  is  more  or  less  arbitrary, 

and  we  have  taken  6 ± o2  and  § ± 2u2  , where  a2  is  the 
s s 

inverse  of  the  constant  test  information  (=21.63,  cf.  Appendix  I) 
and  also  the  second  parameter  of  the  conditional  distribution  of 
0 , given  0 . Thus  these  intervals  are  ± 0.215  and 

0g  ± 0.430,  respectively. 

If  we  replace  $( 0 1 0 ) in  (3.3)  by  4>( 0 1 0 ) , then  we  will 

s s 

obtain  the  estimated  item  characteristic  function  which  indicates 
the  maximal  possible  attainment  for  the  present  procedure  and  data. 

This  function  is  not  obtainable  from  empirical  data,  and  only  the 
simulation  study  can  provide  us  with  it . As  usual,  we  shall  call 
it  the  criterion  item  characteristic  function.  Since  we  adopted 
two  different  sets  of  proportioned  weights,  as  was  mentioned  in 
the  preceding  paragraph,  we  shall  obtain  two  criterion  item  character- 
istic functions  for  each  item. 

For  the  same  reason  described  in  the  preceding  study  (Samejima,  - r„ 

1978d) , we  use  the  two  methods, i.e. , Pearson  System  Method  and  Two- 
Parameter  Beta  Method,  for  obtaining  <t>(0|0g),  and  exclude  Normal 
Approach  Method. 


i 


IV  Results  1^:  Proportioned  Weights 

Figure  4-1  presents  the  estimated  probability,  or  proportioned 

weight,  p(s£G)  , using  each  of  the  two  intervals,  0 1 0,215  and 

s 

A A 

0 ± 0.430  , for  each  of  the  500  maximum  likelihood  estimates  9 
s s 

We  can  see  that  these  two  sets  of  results,  plotted  by  solid  triangles 
and  crosses  respectively,  are  for  the  most  part  overlapping,  except 
for  some  extreme  values  of  0g  . This  finding  suggests  that  we  are 
likely  to  obtain  similar  estimated  item  characteristic  functions 
as  the  results  of  using  these  two  sets  of  proportioned  weights. 

It  should  be  noted  that  these  proportioned  weights  adopted 
in  the  Proportioned  Sum  Procedure  in  the  present  study  themselves 
can  be  regarded  as  estimates  of  the  item  characteristic  function 
of  each  binary  item.  Since  we  have  the  relationship  (Samejima, 
1977b) 


(4.1) 


Var. (0)  = Var , (0)  + a2  , 


and  we  obtain 


(4.2) 


E(0)  - 0 , 


(4.3) 


Var. (0) 


/: 


25 


02  f (0)  d0  “ 0.2  [-±-  0: 


2.5 


-2.5 


12 


2.083333  , 


(4.4) 
and 

(4.5) 


Var.(S)1/2  i [2.083333  + 0.046225]1/2  - 1.459301 


Var. (0) 1/2  i 1.443376  , 


it  is  expected  that  the  abscissa  be  stretched  by  approximately  1.1  K 
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for  these  estimates.  By  virtue  of  the  fact  that  this  percentage 
is  negligibly  small,  in  each  graph  of  Figure  4-1,  the  theoretical 
item  characteristic  function,  which  follows  the  normal  ogive  model 
with  the  two  parameters  shown  in  Tables  6-3,  6-4,  etc.,  is  also 
drawn,  using  the  axis  of  § as  if  it  were  the  axis  of  ability  9 . 
As  we  can  see  in  these  graphs,  these  proportional  weights  themselves 
are  fairly  close  to  the  item  characteristic  functions. 

It  should  be  added  that  the  estimated  probability,  or  propor- 
tioned weight,  for  the  item  score  category  0 is  given  by 


(4.6) 


p(seG)  » 1 - p(scG)  , 


where  G is  the  group  of  examinees  who  did  not  answer  item  g correctly, 
or  the  failure  group.  These  ten  graphs  in  Figure  4-1  present, 
therefore,  this  other  set  of  proportioned  weights  for  each  of  the  two 
cases  also,  if  we  read  each  value  from  the  top  of  the  graph  downwards. 

A question  arises  as  to  whether  it  is  necessary  to  use  all  the 
500  proportioned  weights,  p(seG)  , in  the  estimation  of  the  item 
characteristic  function.  It  will  be  simpler  and  easier  if  we  can 
reduce  this  number  substantially.  For  this  reason,  we  experiment 
using  61  points  of  6 starting  with  0 « -3.0  and  ending  with 
9 “ 3.0  with  the  step  of  0.1  , Instead  of  the  500  values  of  , 

and  compute  the  values  of  p(scG)  in  the  same  way.  The  two  sets 
of  results  are  plotted  in  Figure  4-1  by  dotted  and  dashed  curves, 
respectively.  We  can  see  that  these  curves  overlap,  mostly,  with 
correspoinding  sets  of  plots  for  the  500  §s’s  , as  is  naturally 
expected . 


- ■ . - - t 


fir  .■ 
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V Results  II:  Estimated  Density  Functions  of  Ability 

Figure  5-1  presents  the  estimated  density  function,  ?(0)  , 

obtained  by  the  Proportioned  Sum  Procedure,  using  the  500  0 ’s  > 

s 

by  dotted  line,  which  is  also  the  denominator  of  (3.3)  with  <j>(0|§s) 
replaced  by  its  theoretical  counterpart,  4>( 0 1 ®s)  » an<*  -*-s  divided  by 

the  total  number  of  examinees,  500  . Note  that  this  estimated 
density  function  is  the  same  as  the  one  obtained  by  the  Simple  Sum 
Procedure  (Samejima,  1978d)  of  the  Conditional  P,D,F,  Approach, 

For  comparison,  the  estimated  density  function  of  0 obtained  by 
the  Pseudo  Criterion  Degree  4 Case  of  the  Weighted  Sum  Procedure 
(Samejima,  1978d)  is  drawn  by  broken  and  dotted  line,  together  with 
the  theoretical  density  function,  f(0)  . We  can  see  that  the  f(8) 
obtained  by  the  Proportioned  Sum  Procedure  is  reasonably  close  to 
the  theoretical  density  function,  although  not  as  close  as  the  one  by  the 
Pseudo  Criterion  Degree  4 Case  of  the  Weighted  Sum  Procedure, 

As  an  additional  information,  the  same  denominator  computed 
for  the  61  points  of  0 and  divided  by  61  is  presented  in  the  same 
figure  by  dashed  line.  This  is  by  no  means  an  estimate  of  the  density 

function  f ( 0)  , and  it  is  interesting  to  note  that  the  curve  is 
substantially  different  from  the  others. 

The  mean  square  error  which  is  obtained  from 

(5.1)  -±-  ^ [ f ( 0^ ) - f (6j ) ] 2 , 

where  m » 25  and  ®j's  are  “2.4  through  2.4  with  the  step  0.2  is 
computed,  and  it  turned  out  to  be  0.00018  (0.01324),  which  is  a little 
greater  than  0.00009  (0.00972)  for  the  £(0)  by  the  Pseudo  Criterion 
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Degree  4 Case  of  the  Weighted  Sum  Procedure.  (The  numbers  in  parentheses 
are  the  corresponding  square  roots  of  the  mean  square  errors.)  For 
an  additional  information,  if  we  calculate  the  value  given  by  (5.1) 
for  the  61  point  case,  it  will  be  0.00350  (0.05918), 

The  estimated  shared  density  function  (Samejima,  1978d)  for 
each  of  the  failure  and  success  groups  of  each  item  is  presented 
as  Figure  5-2,  for  the  two  criterion  cases  of  the  present  study, 
together  with  the  one  obtained  by  the  Simple  Sum  Procedure,  the  actual 
frequency  ratios  of  0 and  the  theoretical  shared  density.  (Again, 
for  an  additional  information,  the  corresponding  curves  for  the  61 
point  case  are  also  presented  in  the  same  figure,  which  are  by  no 
means  estimated  shared  densities.)  We  find  that  these  two  curves  are 
fairly  close  to  the  one  by  the  Simple  Sum  Procedure,  which  indicates 
that  they  are  not  very  close  to  the  three  curves  provided  by  the 
Pseudo  Criterion  Degree  3,  4 and  5 Cases  of  the  Weighted  Sum  Procedure 
(Samejima,  1978d) . 

Figure  5-3  presents  the  estimated  density  functions,  £(0)  , 
obtained  by  Degree  3 and  4 Cases*  of  the  Pearson  System  Method.  We 
can  see  that  these  results,  especially  the  one  obtained  by  Degree  3 

l 

Case,  are  very  similar  to  the  one  obtained  by  the  criterion  case. 

The  means  square  errors  of  the  ?(0)'s  obtained  by  Degree  3 
and  4 Cases  of  the  Pearson  System  Method  and  their  square  roots  were 
calculated  through  (5,1),  and  they  are  0.00076  (0.02752)  and  0.00105 
(0,03242)  respectively.  As  was  expected,  they  are  reasonably  small, 
but  not  as  small  as  those  for  the  criterion  case,  which  turned  out 


* Refer  to  Appendix  II. 
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FIGURE  5-2 

Comparison  of  the  Four  Estimated  Shared  Density  Functions  of  Ability 
of  Each  of  the  Two  Item  Score  Groups,  Which  are  Obtained  by  the  Proportioned 
Sum  Procedure,  Using  the  True  Conditional  Density  Function,  $(0|8  ), 
with  the  One  Obtained  by  the  Simple  Sum  Procedure.  Theoretical8 
Shared  Density  Function  and  Actual  Frequency  Ratios  Are  Also 

Presented . 
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FIGURE  5-2:  Criterion  Case  (Continued) 
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FIGURE  5-2:  Criterion  Case  (Continued) 
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FIGURE  5-2:  Criterion  Case  (Continued) 
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to  be  0,00018  (0,01324),  as  was  mentioned  earlier, 

The  estimated  shared  density  functions  by  the  failure  and  success 
groups  obtained  by  Degree  3 and  4 Cases  of  the  Pearson  System  Method 
for  the  two  intervals  of  each  are  presented  in  Figure  5-4,  along 

with  those  obtained  by  the  criterion  case  of  the  Simple  Sum  Procedure, 
the  actual  frequency  ratios  and  the  theoretical  shared  density  functions. 

We  find  that  the  two  curves  for  the  different  proportioned  weights 
are  very  close  to  each  other,  in  each  of  Degree  3 and  4 Cases. 

Figure  5-5  presents  the  estimated  density  functions,  ?(0)  , 
obtained  by  Degree  3 and  4 Cases  of  the  Two-Parameter  Beta  Method, 

Just  like  those  obtained  by  the  Pearson  System  Method,  these  two 
curves  have  "tails"  outside  the  interval  of  0 , (-2,5,  2,5)  , for 

which  the  theoretical  density,  f(0)  , assumes  a constant  positive 
value,  0,2  , It  is  noticed  that  within  the  interval  of  0 these 
two  curves  are  very  close  to  each  other,  and,  in  fact,  more  so  than 
those  obtained  by  the  Pearson  System  Method. 

The  mean  square  errors  for  these  two  estimated  density  functions 
from  the  theoretical  density  functions,  which  were  computed  through 
(5,1),  turned  out  to  be  0.00077  (0,02766)  and  0,00064  (0.02529)  for 

. 

Degree  3 and  4 Cases,  respectively.  They  are  both  comparable  to 
those  by  Degree  3 Case  of  the  Pearson  System  Method,  i.e,,  0,00076 
(0.02752) . 

The  estimated  shared  density  functions  of  0 by  the  failure 
and  success  groups  obtained  by  the  Two-Parameter  Beta  Method  are 
presented  as  Figure  5-6,  by  the  four  curves  each  of  which  represents 
Degree  3 Case  with  the  interval  ± 0.215,  Degree  3 Case  with 

0 ± 0,430,  Degree  4 Case  with  0 ± 0,215,  or  Degree  4 Case  with 

s s 


FIGURE  5-4:  Pearson  System  Method  (Continued) 


FIGURE  5-4:  Pearson  System  Method  (Continued) 


FIGURE  5-4:  Pearson  System  Method  (Continued) 
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FIGURE  5-4:  Pearson  System  Method  (Continued) 


FIGURE  5-4:  Pearson  System  Method  (Continued) 


FIGURE  5-4:  Pearson  System  Method  (Continued) 


FIGURE  5-4:  Pearson  System  Method  (Continued) 
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FIGURE  5-4:  Pearson  System  Method  (Continued) 
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0 ± 0.430  , Presented  in  the  same  figure  are  those  obtained  by 

s 

the  criterion  case  of  the  Simple  Sum  Procedure,  the  actual  frequency 
ratios  and  the  theoretical  shared  densities.  Just  as  we  have  observed 
in  the  results  obtained  by  the  Pearson  System  Method,  in  each  of  Degree 
3 and  4 Cases,  the  two  curves  for  the  estimated  shared  density  functions 
using  the  two  different  intervals  of  0g  in  obtaining  the  propor- 
tioned weights  are  very  close  to  each  other.  It  is  also  noticed 
that  the  set  of  four  curves  for  each  item  is  not  too  different  from 
the  set  of  those  obtained  by  the  Pearson  System  Method,  which  are 
presented  in  Figure  5-4. 
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FIGURE  5-6:  Two-Parameter  Beta  Method  (Continued) 


FIGURE  5-6:  Two-Parameter  Beta  Method  (Continued) 


FIGURE  5-6:  Two-Parameter  Beta  Method  (Continued) 


FIGURE  5-6:  Two-Parameter  Beta  Method  (Continued) 
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VI  Results  III ; Estimated  Item  Characteristic  Functions 

Figure  6-1  presents  the  resultant  criterion  item  characteristic 
functions  using  two  different  intervals,  6 ± 0.215  and  § ± 0,430  , 

in  calculating  the  proportioned  weights,  for  each  of  the  ten  binary 
items,  together  with  the  actual  frequency  ratios  of  0 and  the 
theoretical  item  characteristic  function.  In  the  same  figure,  also 
presented  are  the  two  criterion  item  characteristic  functions,  which 
were  obtained  using  the  61  points  of  § instead  of  the  500  §s's  • 

It  is  easily  seen  that,  for  all  the  items,  these  four  criterion  item 
characteristic  functions  are  closer  to  the  item  characteristic  functions 
than  the  actual  frequency  ratios  of  0 , and  yet  the  influence  of 
the  actual  frequency  ratios  on  these  criterion  item  characteristic 
functions  is  obvious  in  the  way  they  are  deviates}  from  the  theoretical 
item  characteristic  functions. 

It  should  be  noted  that  for  each  item  these  four  criterion 
item  characteristic  functions  are  very  close  to  one  another,  whether 
the  500  ®s's  are  used  or  only  the  61  points  are  used  for  computing 
the  proportioned  weights.  This  fact  is  an  interesting  finding, 
since  the  estimated  density  functions  of  0 and  the  estimated 
shared  density  functions  of  0 by  the  failure  and  success  groups  are 
substantially  different  between  the  500  case  and  the  61  point 

case,  as  we  can  see  in  Figures  5-1  and  5-2. 

The  mean  square  errors  of  the  criterion  item  characteristic 
functions  from  the  theoretical  item  characteristic  function  for  each 
of  the  ten  binary  items,  which  are  obtained  by 


- I [P  (0,)  - P (0 .) ]‘ 

j_l  g J 8 j 


(6.1) 


m 


i 
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where  m = 25  and  G^'s  are  -2,4  through  2,4  with  the  step  of  0.2  , 
were  computed  for  each  criterion  item  characteristic  function  of 
each  item,  and  these  values  and  their  square  roots  are  presented 
in  Tables  6-1  and  6-2, 

It  is  noticed  that  among  these  four  sets  of  mean  square  errors, 
or  their  square  roots,  there  are  no  substantial  differences,  which 
indicates  that  these  four  sets  of  criterion  item  characteristic 
functions  are  just  as  good  as  one  another.  There  is  even  a slight 
tendency  that  the  two  sets  for  the  61  point  case  have  smaller  values 
than  those  for  the  500  §g's  case,  which  encourages  the  idea  of 
reducing  the  number  of  points  of  the  maximum  likelihood  estimate 
for  which  the  conditional  density,  <J>(6|6)  > is  estimated  and  used 
for  the  estimation  of  the  operating  characteristics. 

The  discrimination  parameter,  a^  , and  the  difficulty  parameter, 
b^  , in  the  normal  ogive  model*  were  estimated  by  the  usual  least 
square  method  for  each  of  the  ten  binary  items,  and  are  presented 
as  Tables  6-3  and  6-4.  It  is  obvious  that  these  estimated  para- 
meters are  just  as  good  as  those  obtained  by  the  Simple  Sum  Procedure, 
which  are  also  presented  in  these  tables,  if  not  substantially  better. 
This  Indicates  that  they  are  also  comparable  with  those  obtained  from 
the  Pseudo  Criterion  Item  Characteristic  Functions  of  Degree  3,  4 and 
5 Cases,  as  we  can  see  in  one  of  the  previous  studies  (Samejima,  1978d). 
As  before,  in  the  present  results  of  the  Proportioned  Sum  Procedure, 
in  general,  the  estimates  of  the  difficulty  parameters  b^  are  closer 
to  the  true  values  than  those  of  the  discrimination  parameters  a . 

O 


* Refer  to  Appendix  I. 
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FIGURE  6-1:  Criterion  Case  (Continue 
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FIGURE  6-1:  Criterion  Case  (Continued) 
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FIGURE  6-1:  Criterion  Case  (Continued) 
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Mean  Square  Errors  of  the  Estimated  Item  Characteristic  Functions  Against 
the  True  Item  Characteristic  Function.  These  Functions  Are  the  Four 
Criterion  Item  Characteristic  Functions,  Those  Obtained  by  the 
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TABLE  6-3 

Estimated  Item  Discrimination  Parameters,  a , Obtained  from  the 
Four  Criterion  Item  Characteristic  Functionl  of  the  Proportioned 
Sum  Procedure  of  the  Conditional  P.D.F.  Approach,  with  the  One 
from  the  Criterion  Item  Characteristic  Function  by  the  Simple  Sum 
Procedure  (SSP) , for  Each  of  the  Ten  Binary  Items. 
Estimation  Is  Based  on  the  Least  Squares  Principle, 

Using,  at  Most,  25  Points  of  6 in  the  Interval, 

J^-2.4,  2.4],  Excluding  the  Points  for  Which 

P (6)  Is  Less  Than  0. 05  or  Greater  Than  0.95. 

g 


* 


ITEM 

TRUE 

CRITERION 

SSP 

CRITERION 

500 

RLE 

61 

RLE 

Inttl 

£ 0.430 

2 0.215 

2 0.430 

1 

m 

1.330  s 

1.064  s 

1.683  5 

1.327  5 

1.400  5 

2 

1.023 

C.970 

0.545 

0.535 

1.024 

3 

2.5 

1.740 

1.642 

1.755 

1.660 

1.788 

4 

1.0 

0.873 

0.821 

0.854 

0.836 

0.868 

5 

1.5 

1.377 

1.286 

1.372 

1.303 

1.368 

6 

1.0 

0.854 

0.882 

0.872 

0.872 

0.855 

7 

2.0 

1.475 

1.415 

1.42C 

1.405 

1.473 

8 

1.0 

0.510 

0.873 

0.885 

G.516 

0.886 

5 

2.0 

1.708 

1.522 

1.741 

1.653 

1.716 

10 

1.0 

0.757 

C.140 

C.757 

0.765 

0.725 

THE  NUMBER  OF  INTERVALS  USEO  IN  ESTIMATION  IS 
SHOWN  AS  A SUBSCRIPT  WHEN  IT  IS  LESS  THAN  6. 
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TABLE  6-4 

Estimated  Item  Difficulty  Parameters,  b , Obtained  from  the 
Four  Criterion  Item  Characteristic  Functions  of  the  Proportioned 
Sum  Procedure  of  the  Conditional  P.D.F.  Approach,  with  the  One 
from  the  Criterion  Item  Characteristic  Function  by  the 
Simple  Sum  Procedure  (SSP) , for  Each  of  the  Ten  Binary 
Items.  Estimation  Is  Based  On  the  Least  Squares 
Principle,  Using,  at  Most,  25  Points  of  6 in  the 
Interval,  [-2.4,  2.4],  Excluding  the  Points  for 

Which  P (0)  Is  Less  Than  0.05  or  Greater  Than  0.95. 

g 


ITEM 

TRUE 

CRITERION 

500 

MLE 

61 

MLE 

SSP 

tm 

* 0.215 

t C.430 

CRITERION 

1 

-2.5 

— 2.6925 

-2.657,. 

-2.467 

s 

-2.638 

s 

-2.651 

$ 

2 

1 

M 

• 

O 

-2.001 

-2.C31 

-2.C75 

-2.036 

-2.002 

2 

-1.5 

-1.512 

-1.526 

-1.509 

-1.513 

-1.507 

4 

0 
• 

pH 

1 

-C.997 

-1.030 

-1.C16 

-1.022 

-1.005 

5 

-0.5 

-0.471 

-0.468 

-0.454 

-0.467 

-0.472 

6 

o 

« 

o 

-0.063 

— 0.061 

-0.C83 

-C.C70 

-0.075 

7 

0.5 

0.522 

C.522 

0.533 

0.527 

0.527 

8 

1.0 

0.576 

0.577 

0.551 

0.940 

0.981 

5 

1.5 

1.518 

1.511 

1.49C 

1.496 

1.502 

10 

2.0 

2.116 

2.157 

2.048 

2.080 

2.118 

THE  NUMBER  OF  INTERVALS  USED  IN  ESTIMATION  IS 
SHOWN  AS  A SUBSCRIPT  WHEN  IT  IS  LESS  THAN  6. 
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As  was  the  case  of  the  Simple  Sum  and  Weighted  Sum  Procedures,  we 

notice  that  the  estimates  of  the  discrimination  parameters  a are 

8 

further  from  the  true  values  when  a 's  are  large,  like  for  items 

8 

3,  7 and  9.  It  is  interesting  to  note  that,  although  the  parameter 
estimation  tends  to  be  more  inaccurate  for  very  difficult  and  very 
easy  items  in  most  cases,  this  tendency  is  ameliorated  when  the 
61  point  case  is  used,  in  both  the  discrimination  and  difficulty 
parameter  estimations. 

Figure  6-2  presents  the  resulting  estimated  item  characteristic 
functions  obtained  by  Degree  3 and  4 Cases  of  the  Pearson  System  Method, 

A 

each  of  which  has  two  results  using  the  intervals,  0g  ± 0.215  and 
0g  ± 0.430  , in  calculating  the  proportioned  weights,  respectively. 

In  the  same  figure,  for  each  item,  also  presented  are  the  criterion 
item  characteristic  function  obtained  by  the  Simple  Sum  Procedure, 
the  actual  frequency  ratios  of  0 and  the  theoretical  item  characteristic 
function.  We  notice  that  these  four  curves  for  each  item  are  fairly 
close  to  one  another,  and  also  to  the  criterion  item  characteristic 
function  obtained  by  the  Simple  Sum  Procedure,  for  the  interval  of  0 , 
(-2.5,  2.5).  It  is  also  noted  that  these  estimated  item  character- 
istic functions  are  similar  to  the  corresponding  criterion  item  charac- 
teristic functions,  if  we  compare  Figure  6-2  with  Figure  6-1. 

The  mean  square  errors  and  their  square  roots,  which  were 
calculated  through  (6.1),  are  shown  in  Tables  6-1  and  6-2,  respectively. 

We  can  see  that  these  values  are  very  close  to  those  for  the  criterion 
item  characteristic  functions,  which  are  presented  in  the  same  tables. 
There  are  no  tendencies  that  those  obtained  by  Degree  4 Case  are  smaller 
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than  those  obtained  by  Degree  3 Case  of  the  Pearson  System  Method, 
or  vice  versa. 

Tables  6-5  and  6-6  present  the  estimated  discrimination 

a A 

parameter,  a , and  difficulty  parameter,  b , by  the  least 
S 6 , 

square  method,  for  each  of  the  ten  binary  items,  in  comparison 
with  those  obtained  from  the  criterion  item  characteristic  function 
by  the  Simple  Sum  Procedure.  These  results  are  very  similar  to 
those  obtained  from  the  criterion  item  characteristic  functions, 
and  the  same  tendencies  as  were  described  earlier  for  the  estimates 
from  the  criterion  item  characteristic  functions  are  also  observed 
among  these  estimates,  although  the  latter  tend  to  be  slightly  more 
inaccurate  than  the  former.  It  is  interesting  to  note  that  the 
estimates  of  the  discrimination  parameters  in  Degree  3 Case  with 
the  interval  0g  +0.215  tend  to  be  better  than  the  others,  although 
the  differences  are  small. 

Figure.  6-3  presents  the  estimated  item  characteristic  functions 

of  each  of  the  ten  binary  items  obtained  by  Degree  3 and  4 Cases  of 

the  Two-Parameter  Beta  Method,  each  having  two  functions  by  using 

the  two  intervals,  § +0.215  and  0 +0.430  for  calculating  the 

s — S ” 

proportioned  weights,  along  with  the  criterion  item  characteristic 
function  by  the  Simple  Sum  Procedure  of  the  Conditional  P.D.F.  Approach, 
the  actual  frequency  ratios  of  0 and  the  theoretical  item  character- 
istic function.  Again,  these  resultant  functions  are  close  to  one 
another,  to  the  criterion  item  characteristic  function  by  the  Simple 
Sum  Procedure,  and  to  the  criterion  item  characteristic  functions  by 
the  present  procedure,  which  are  presented  in  Figure  6-1,  for  the 
interval  of  0 , (-2.5,  2.5). 
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Procedure.  Actual  Frequency  Ratios  and  Theoretical  Item 
Characteristic  Function  Are  Also  Presented  for  Comparison. 
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FIGURE  6-2:  Pearson  System  Method  (Continued) 
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FIGURE  6-2:  Pearson  System  Method  (Continued) 
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FIGURE  6-2:  Psarson  System  Method  (Continued) 
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FIGURE  6-2:  Pearson  System  Method  (Continued) 
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FIGURE  6-2:  Pearson  System  Method  (Continued) 


TABLE  6-5 

Estimated  Item  Discrimination  Parameters,  a , Obtained  from 
the  Four  Estimated  Item  Characteristic  Functions  by  Degree  3 
and  4 Cases  of  the  Pearson  System  Method  (PSM)  of  the  Proportioned 
Sum  Procedure  of  the  Conditional  P.D.F.  Approach , with  the 
One  Obtained  from  the  Criterion  Item  Characteristic  Function 
by  the  Simple  Sum  Procedure  (SSP)  , for  Each  of  the  Ten  Binary 
Items.  Estimation  Is  Based  on  the  Least  Squares  Principle, 
Using,  at  Most,  25  Points  of  0 in  the  Interval,  [-2.4,  2.4] 
Excluding  the  Points  for  Which  P (0)  Is  Less  Than  0.05  or 

Greater  Than  ® 0.95  . 


ITEM 

TRUE 

PSM 

SSP 

CRITERION 

DGR. 

3 

DGR. 

4 

♦ 0.215 

t 0.430 

* 0.215 

± 0.430 

1 

1.5 

1.123  5 

0.976  5 

1.022  5 

0. 957  s 

1.400  5 

2 

1.0 

1.048 

0.975 

1.072 

0.983 

1.024 

3 

2.5 

1.741 

1.638 

1.759 

1.641 

1.788 

4 

1.0 

0.886 

0.828 

0.884 

0.822 

0.868 

5 

1 .5 

1.391 

1 .298 

1 .377 

1.285 

1.368 

6 

1 .0 

0.904 

0.891 

0.848 

0.884 

0.895 

7 

2.0 

1.486 

1.424 

1.477 

1.416 

1.473 

8 

1 .0 

0.698 

0.863 

0.883 

0.851 

0.886 

9 

2.0 

1.666 

1.492 

1.642 

1.482 

1.716 

10 

1.0 

0.  742 

0.727 

0.736 

0.  718 

0.725 

THE  NUMBER  OF  INTERVALS  USED  IN  ESTIMATION  IS 
SHOWN  AS  A SUBSCRIPT  WHEN  IT  IS  LESS  THAN  6. 


TABLE  6-6 


Estimated  Item  Difficulty  Parameters,  b , Obtained  from  the 
Four  Estimated  Item  Characteristic  Functions  by  Degree  3 and 
4 Cases  of  the  Pearson  System  Method  (PSM)  of  the  Proportioned 
Sum  Procedure  of  the  Conditional  P.D.F.  Approach,  with  the 
One  Obtained  from  the  Criterion  Item  Characteristic  Function 
by  the  Simple  Sum  Procedure  (SSP)  , for  Each  of  the  Ten  Binary 
Items.  Estimation  Is  Based  on  the  Least  Squares  Principle, 
Using,  at  Most,  25  Points  of  9 in  the  Interval,  [-2.4,  2. A], 
Excluding  the  Points  for  Which  P (0)  Is  Less  than  0.05  or 

Greater  Than  ® 0,95  . 


ITEM 

TRUE 

FSM 

DGR. 

3 

DGR. 

A 

SSP 

CRITERION 

♦ 0.215 

♦ 0.430 

■MM 

mrnmm 

t 0.430 

1 

-2,5 

-2.833 

-2.533 

—2.952 

—2.568 

-2.651 _ 

5 

s 

5 

5 

5 

2 

SZ9 

-1.570 

-2.015 

-1.560 

-2.019 

-2.002 

3 

-1.5 

-1.500 

-1.517 

-1.507 

-1.525 

-1.507 

A 

-1.0 

-0.584 

-1.023 

-0.584 

-1.027 

-1.005 

5 

-0.5 

-0.467 

-0.464 

-0.471 

-0.468 

-0.472 

6 

0.0 

-0.062 

-0.061 

-0.088 

-0.060 

-0.075 

7 

0.5 

0.517 

0.317 

0.522 

C.522 

0.527 

e 

1.0 

0.583 

0.582 

0.556 

0.994 

0.981 

9 

D 

1.515 

1.509 

1.533 

1.521 

1.502 

10 

2.134 

2.172 

2.154 

2.198 

2.118 

THE  NUMBER  CF  INTERVALS  USED  IN  ESTIMATION  IS 
SHOWN  AS  A SUBSCRIPT  WEEN  IT  IS  LESS  THAN  6. 
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FIGURE  6-3:  Two-Parameter  Beta  Method  (Continued) 
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FIGURE  6-3:  Two-Parameter  Beta  Method  (Continued) 
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FIGURE  6-3:  Two-Parameter  Beta  Method  (Continued) 
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FIGURE  6-3:  Two-Parameter  Beta  Method  (Continued) 
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TABLE  6-7 

Estimated  Item  Discrimination  Parameters,  a , Obtained  from 
the  Four  Estimated  Item  Characteristic  Funt&ions  by  Degree  3 
and  4 Cases  of  the  Two-Parameter  Beta  Method  (TPBM)  of  the  Propor- 
tioned Sum  Procedure  of  the  Conditional  P.D.F.  Approach,  with 
the  One  Obtained  from  the  Criterion  Item  Characteristic 
Function  by  the  Simple  Sum  Procedure  (SSP) , for  Each  of 
the  Ten  Binary  Items.  Estimation  Is  Based  on  the  Least 
Squares  Principle,  Using,  at  Most,  25  Points  of  0 in 
the  Interval,  [-2.4,  2.4],  Excluding  the  Points  for 

Which  f (9)  Is  Less  Than  0.05  or  Greater  Than  0.95. 

g 


ITEN 

TRUE 

TPBM 

SSP 

CRITERION 

DGR. 

3 

DGR. 

4 

± 0.215 

♦ 0.430 

i 0.215 

t 0.430 

1 

1.5 

1. 10  5 ^ 

0.579s 

1.135s 

C.980$ 

1.4°°5 

2 

1.0 

1.045 

0.574 

1.055 

C .581 

1.024 

3 

2.5 

1.739 

1.636 

1.754 

1.644 

1.788 

A 

1.0 

0.B86 

O.E27 

0.675 

0.821 

0.868 

5 

1.5 

1.369 

1.298 

1.375 

1.284 

1.368 

6 

1.0 

0.904 

0.891 

0.656 

0.883 

0.895 

7 

2.0 

1.488 

1.424 

1.475 

1.415 

1.473 

8 

1.0 

0.856 

0.864 

0.501 

0.866 

0.886 

9 

2.0 

1.674 

1.493 

1.7CC 

1.510 

1.716 

10 

1.0 

0.740 

0.728 

0.749 

0.738 

0.725 

THE  NUMBER  OF  INTERVALS  USED  IN  ESTIMATION  IS 
SHOWN  AS  A SUBSCRIPT  WFEN  IT  IS  LESS  THAN  6. 
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TABLE  6-8 

Estimated  Item  Difficulty  Parameters,  6g  , Obtained  from  the 
Four  Estimated  Item  Characteristic  Functions  by  Degree  3 and 
4 Cases  of  the  Two-Parameter  Beta  Method  (TPBM)  of  the  Propor- 
tioned Sum  Procedure  of  the  Conditional  P.D.F.  Approach,  with 
the  One  Obtained  from  the  Criterion  Item  Characteristic 
Function  by  the  Simple  Sum  Procedure  (SSP) , for  Each  of 
the  Ten  Binary  Items.  Estimation  Is  Based  on  the  Least 
Squares  Principle,  Using,  at  Most,  25  Points  of  0 in 
the  Interval,  [-2.4,  2.4] , Excluding  the  Points  for 

Which  P (6)  Is  Less  Than  0.05  or  Greater  Than  0.95. 

g 


ITEM 

TRUE 

TPBM 

SSP 

CRITERION 

DGR.3 

DGR.4 

BBS 

♦ 0.430 

t 0.215 

t 0.430 

1 

-2.5 

-2.839 

-2.931 

-2.827 

-2.938 

-2.651 

5 

5 

5 

5 

5 

2 

-2.0 

-1.972 

-2.015 

-1.973 

-2.019 

-2.002 

3 

-1.5 

-1.500 

-1. 516 

-1.508 

-1. 524 

-1.507 

4 

-1.0 

-0.986 

-1.023 

-0.991 

-1.026 

-1.005 

5 

-0.5 

-0.467 

-0.463 

-0.471 

-0.466 

-0.472 

6 

0.0 

-0.061 

-0.060 

-0.062 

-0.060 

-0.075 

7 

0.5 

0.518 

0.518 

0.523 

0.522 

0.527 

8 

1 .0 

0.985 

0.982 

0.981 

0.982 

0.981 

9 

mwm 

1.520 

1. 509 

1.523 

1.514 

1.502 

10 

2.138 

2.171 

2.130 

2.  162 

2.118 

THE  NUMBER  OF  INTERVALS  USED  IN  ESTIMATION  IS 
SHOWN  AS  A SUBSCRIPT  WHEN  IT  IS  LESS  THAN  6. 


The  mean  square  errors,  and  their  square  roots,  of  these 
estimated  item  characteristic  functions  from  the  true  item  character- 
istic functions,  which  are  obtained  through  (6,1),  are  presented  in 
Tables  6-1  and  6-2,  Again  these  results  are  similar  to  those  obtained 
by  the  Pearson  System  Method,  and  those  from  the  criterion  item 
characteristic  functions  by  the  Simple  Sum  Procedure  and  from  those 
by  the  Proportioned  Sum  Procedure, 


The  estimated  discrimination  and  difficulty  parameters,  a^  and 

6 , are  presented  in  Tables  6-7  and  6-8.  They  are  comparable  to 

8 

those  obtained  by  the  Pearson  System  Method,  and,  again,  there  is  a 
slight  tendency  that  the  estimates  of  the  discrimination  parameters 

A 

are  better  for  Degree  3 Case  with  the  interval,  9 ± 0,215, 

s 


I 


t 

VII  Discussion  and  Conclusion 

k 

Conditional  P.D.F.  Approach  was  expanded,  and  a new  procedure, 

( 

i.e.,  Proportioned  Sum  Procedure,  was  introduced,  and  tried  on  the 
simulated  data,  which  have  been  used  to  test  many  different  methods 
and  approaches.  As  the  result,  we  have  obtained  sufficiently  good 
estimates  of  the  item  characteristic  functions  of  the  ten  binary 
items.  They  are  similar  to  the  results  obtained  in  previous  studies, 
using  Simple  Sum  Procedure  and  Weighted  Sum  Procedure,  however,  and 
we  cannot  say  that  the  present  method  is  superior  to  the  others. 

Thus  we  ended  up  with  expanding  our  repertory  of  methods  of 
estimating  the  operating  characteristics  of  the  graded  item 
categories . 

One  Interesting  finding  is  that,  using  as  small  a number  of 
points  of  0 as  61,  we  have  succeeded  in  producing  criterion  item 
characteristic  functions  which  are  practically  the  same  as  those 
obtained  on  the  500  ^s's  • This  result  is  all  the  more  interesting 
if  we  call  our  attention  to  the  fact  that  neither  of  the  numerator 
and  the  denominator  of  (3.3),  which  are  divided  by  61,  is  close  to 
the  density  function  of  6 and  the  shared  density  function  of  0 

f 

by  the  success  group,  as  we  can  see  in  Figures  5-1  and  5-2.  Although 
it  is  too  early  to  jump  to  the  conclusion,  if  this  is  further  tested 
with  different  types  of  data  and  proves  to  be  successful,  this  will 
contribute  to  saving  our  time  and  trouble  in  the  estimation  process. 

In  such  a case,  the  set  of  a certain  larger  number  of  maximum  likelihood 
estimates  is  used,  mainly,  for  obtaining  the  estimates  of  the  moments, 
and  to  specify  a polynomial  of  a certain  degree  using  the  method  of  moments 


to  approximate  the  density  function  of  the  maximum  likelihood,  g(0)  , 
for  all  the  conditional  moments  of  ability  0 , given  0 , are 

A . 

estimated  from  g(0)  and  cr  (cf.  Appendix  II). 

As  was  the  case  in  most  of  the  previous  studies,  we  have  found 
no  substantial  differences  between  the  set  of  results  obtained  by 
Degree  3 Case  and  that  by  Degree  4 Case,  in  spite  of  the  fact  that 
these  two  polynomials  of  degree  3 and  4 are  substantially  different 
(cf.  Appendix  II).  It  seems  that  the  accuracy  of  estimation  of 

A 

the  density  function  g(0)  is  not  too  important,  although  we  should 
try  not  to  generalize  this  tentative  conclusion  too  far.  Again,  we 
find  no  significant  differences  between  the  results  obtained  by  the 
Pearson  System  Method  and  the  Two-Parameter  Beta  Method,  and  both 
of  them  proved  to  be  very  close  to  the  criterion  item  characteristic 


functions. 
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A-I  Simulated  Data 

The  simulated  data  used  In  the  present  study  are  characterized 
as  follows. 


(1)  There  are  500  hypothetical  examinees. 

(2)  Their  ability,  or  latent  trait,  distributes  uniformly  for  the 

interval  of  8 , (-2.5,  2.5).  Actually,  we  use  100  discrete 

points  of  0 , such  as  -2.475,  -2.425,  -2.375,  -2.325,  

2.375,  2.425  and  2.475,  i.e.,  the  midpoints  of  the  100  subintervals 
with  the  width  of  0.05,  and  at  each  point  five  examinees  are 
located. 

(3)  There  is  a hypothetical  test  of  35  graded’ items,  each  of  which 
has  four  item  score  categories , and  which  provides  us  with  an 
approximately  constant  test  information  function,  21.63,  for 

the  interval  of  0 , [-3.0,  3.0],  following  the  normal  ogive 

model  of  the  graded  response  level  (Samejima,  1969,  1972).  The 
test  is  called  the  Old  Test,  to  distinguish  from  the  New  Test, 
which  will  be  described  later. 

(4)  Each  of  the  500  examinees  is  assumed  to  have  taken  the  Old 
Test,  and  his  response  pattern  on  the  35  graded  items  has  been 
calibrated  by  the  Monte  Carlo  method.  The  score  categories  of 
each  item  are  0,  1,  2 and  3,  and  a typical  response  pattern 
looks  like:  (3, 3, 3, 2, 3, 3, 2, 2, 2, 2, 2, 2, 1,2, 2, 2, 1,2, 1,1, 1,0, 1,1, 

1,0, 1,0, 1,1, 0,0, 0,0,0). 

(5)  From  each  response  pattern,  the  maximum  likelihood  estimate  of  the 
examinee's  ability  has  been  obtained,  using  a computer  program 
written  for  this  purpose.  In  this  process,  out  of  140  basic 
functions  (Samejima,  1969,  1972),  an  appropriate  set  of  35  basic 
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functions  are  chosen  depending  upon  the  item  scores  In  the 

response  pattern,  and,  using  the  Newton-Raphson  procedure,  I 

the  point  of  0 at  which  the  sum  total  of  these  35  basic 
functions  equals  zero  is  searched. 

r 

(6)  There  is  another  hypothetical  test  of  10  binary  items,  each 
of  which  follows  the  normal  ogive  model  of  the  dichotomous 
response  level.  This  is  called  the  New  Test. 

(7)  Each  of  the  500  examinees  is  assumed  to  have  taken  the  New 
Test  also,  and  his  response  pattern  on  the  New  Test  has  been 
calibrated  by  the  Monte  Carlo  method.  A typical  response 
pattern  looks  like:  (1,1, 1,0, 1,0,0, 1,0,0) . 

(8)  The  item  characteristic  functions  of  the  test  items  of  the 
New  Test  are  assumed  to  be  unknown,  and  they  are  the  target 

of  estimation.  Each  method  of  estimation  is  evaluated  by  the 


the  "closeness"  of  the  resultant  estimated  item  characteristic 
functions  to  the  true  item  characteristic  functions.  i,e.. 
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A-Il  Conditional  Moments  and  Degree  3,  4 and  5 Cases 

Let  <f>(0|  §)  be  the  conditional  density  function  of  ability  0 , 
given  its  maximum  likelihood  estimate,  § . We  can  write 


i 

(A. 2.1) 


/OO 

jK0|e>  f<0)  d0] 
- <K0|e)  f(0)  tg(6) ]-1  , 


| A 


where  i^(O|0)  is  the  conditional  density  function  of  0 , given  0 , 

A 

f(0)  is  the  probability  density  function  of  ability  0 , and  g(0) 
is  the  probability  density  function  of  the  maximum  likelihood  estimate 

A 

0 . 

By  virtue  of  the  asymptotic  property  of  the  maximum  likelihood 
estimate  and  the  constancy  of  the  test  information  function  of  the 
Old  Test,  the  conditional  density  4>( 6 1 0)  is  approximated  by  the 
normal  density  n(0,o2),  where  a2  » (21.63)"1  - 0.046225  . Because 
of  this  fact,  although  the  density  function  4>( 0 1 0)  given  by  (A. 2.1) 
is  not  observable  in  the  empirical  situation,  it  is  possible  to 
estimate  the  conditional  moments  of  ability  0 , given  the  maximum 

A A 

likelihood  estimate  0 , provided  that  the  density  function  g(0) 
is  estimated.  That  is  to  say,  we  can  derive  the  following  equations. 


i 

! 

1 


(A. 2. 2) 
(A. 2. 3 ) 


(A. 2. 4) 


E(0|0)  A 0 + o2-  log  g(§)  - 0 + 02[-^-  g(§)Hg(§)]"1. 

2 

Var.  < 0 1 0)  i o2ll  + o2  -jp  log  g(0)] 

- o2[i  + o2{-^g(0)-g(o)  - [75-  g(0>]2Hg<e>>"2]. 

E[{0  - E(0 1 0)  } * 1 0]  * log  g(0>] 

■ -o*[  (g(@)  }2*7$'i  g(0)  - 3 g(0)  *7$"  *7^2 

+ 2{-^-  s(6>  >*Hs(d> >“31  . 
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(A.2.5)  E[{0  - E ( 0 1 0)  } **  | 0 J ■ O **  f 3 + 6 a2  log  g(0)}  + 3o4{-^"2  log  g(0)}2 

+ log  g(6)}  ] 

- o“[3  + 6cr2{[g(§)  g(§)  - g(§) }2][g(§)]  2} 

+ 3o“  [g(§)-^g(§)  - g(§)}2]2[g(e)]_1 

+ o4[  (g(0) } g(0)  - 4{g(0)  }2,-jjjp  g(§)  g(9) 

- 3{g(0)  }2{-^~p  g(§)  }2  + 12  g(®)  g(®)  gt®) 

- 6{-^*  8<§>}4ng(0>r4]. 

A 

Thus  It  is  obvious  that  we  need  an  estimate  of  g(0)  in  order 
to  obtain  the  above  four  conditional  moments.  This  is  done  by  approx- 
imating g(6)  by  a polynomial  of  degree  3,  4 or  5 using  the  method 
of  moments  (Elderton  and  Johnson,  1969,  Johnson  and  Kotz,  1970). 

These  three  polynomials  obtained  for  the  five  hundred  maximum  likelihood 
estimates,  which  have  been  used  as  our  simulated  data,  are  as  follows. 

(A. 2. 6)  g(8)  - 0.22416  - 0.003510  - O.O187302  + 0.000950* 

(A. 2. 7)  g(0)  - 0.19620  + 0.002380  + O.O131902  - 0.000620*  - 0.004270“ 

(A. 2. 8)  g(§)  . 0.19539  - 0.006380  + O.O144902  + 0.004050*  - 0.004490“ 

- 0.000480s 

We  distinguish  three  different  cases,  depending  upon  the  polynomial 
used  for  g(0)  , and  call  them  Degree  3,  4 and  5 Cases.  Since  the 
polynomial  of  degree  5,  shown  as  (A. 2. 8),  is  close  to  that  of  degree  4, 
in  many  cases  we  deal  only  with  Degree  3 and  4 Cases. 
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